Two-Phase Biomedical NE Recognition based on SVMs
نویسندگان
چکیده
Using SVMs for named entity recognition, we are often confronted with the multi-class problem. Larger as the number of classes is, more severe the multiclass problem is. Especially, one-vs-rest method is apt to drop the performance by generating severe unbalanced class distribution. In this study, to tackle the problem, we take a two-phase named entity recognition method based on SVMs and dictionary; at the first phase, we try to identify each entity by a SVM classifier and post-process the identified entities by a simple dictionary look-up; at the second phase, we try to classify the semantic class of the identified entity by SVMs. By dividing the task into two subtasks, i.e. the entity identification and the semantic classification, the unbalanced class distribution problem can be alleviated. Furthermore, we can select the features relevant to each task and take an alternative classification method according to the task. The experimental results on the GENIA corpus show that the proposed method is effective not only in the reduction of training cost but also in performance improvement: the identification performance is about 79.9(Fβ = 1), the semantic classification accuracy is about 66.5(Fβ = 1).
منابع مشابه
Disguised Face Recognition by Using Local Phase Quantization and Singular Value Decomposition
Disguised face recognition is a major challenge in the field of face recognition which has been taken less attention. Therefore, in this paper a disguised face recognition algorithm based on Local Phase Quantization (LPQ) method and Singular Value Decomposition (SVD) is presented which deals with two main challenges. The first challenge is when an individual intentionally alters the appearance ...
متن کاملDiscriminative named entity recognition of speech data using speech recognition confidence
This paper presents a method for the named entity recognition (NER) of speech data that uses automatic speech recognition (ASR) confidence as a feature that indicates whether each word is correctly recognized. An NER model is trained using ASR results with named entity (NE) labels to include an ASR confidence feature as well as corresponding transcriptions with NE labels. Experiments using supp...
متن کاملEfficient Support Vector Classifiers for Named Entity Recognition
Named Entity (NE) recognition is a task in which proper nouns and numerical information are extracted from documents and are classified into categories such as person, organization, and date. It is a key technology of Information Extraction and Open-Domain Question Answering. First, we show that an NE recognizer based on Support Vector Machines (SVMs) gives better scores than conventional syste...
متن کاملComparison of SVMs in Number Plate Recognition
High accuracy and high speed are two key issues to consider in automatic number plate recognition (ANPR). In this paper, we construct a recognition method based on Support Vector Machines (SVMs) for ANPR. Firstly, we briefly review some knowledge of SVMs. Then, the number plate recognition algorithm is proposed. The algorithm starts from a collection of samples of characters. The characters in ...
متن کاملA New Approach for Applying Support Vector Machines in Multiclass Problems Using Class Groupings and Truth Tables
The Support Vector Machines (SVMs) had been showing a high capability of complex hyperplane representation and great generalization power. These characteristics lead to the development of more compact and less computational complex methods than the One-versus-Rest (OvR) and One-versus-One (OvO) [1] classical methods in the application of SVMs in multiclass problems. This paper proposes a new me...
متن کامل